CDS

Accession Number TCMCG075C12944
gbkey CDS
Protein Id XP_017974740.1
Location complement(join(20401210..20401487,20401569..20401671,20401820..20401851,20401929..20402146,20402325..20402440,20402540..20402792,20402911..20402998,20403177..20403248,20403360..20403436,20403661..20403718,20403800..20404040))
Gene LOC18601929
GeneID 18601929
Organism Theobroma cacao

Protein

Length 511aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018119251.1
Definition PREDICTED: beta-glucosidase 44 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyl hydrolase 1 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00026        [VIEW IN KEGG]
R02558        [VIEW IN KEGG]
R02887        [VIEW IN KEGG]
R02985        [VIEW IN KEGG]
R03527        [VIEW IN KEGG]
R04949        [VIEW IN KEGG]
R04998        [VIEW IN KEGG]
R10035        [VIEW IN KEGG]
R10039        [VIEW IN KEGG]
R10040        [VIEW IN KEGG]
KEGG_rclass RC00049        [VIEW IN KEGG]
RC00059        [VIEW IN KEGG]
RC00171        [VIEW IN KEGG]
RC00262        [VIEW IN KEGG]
RC00397        [VIEW IN KEGG]
RC00451        [VIEW IN KEGG]
RC00714        [VIEW IN KEGG]
RC00746        [VIEW IN KEGG]
RC01248        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K05350        [VIEW IN KEGG]
EC 3.2.1.21        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00460        [VIEW IN KEGG]
ko00500        [VIEW IN KEGG]
ko00940        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00460        [VIEW IN KEGG]
map00500        [VIEW IN KEGG]
map00940        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGACGACCTTAGTAATTCTGTTGCTGCTGTCACTCACTTTGTTTAGCACTGCTTTGAACGCAAATGCCGATGCATTGTCCCATCTTAAGACACAGCGGTTGGACACCGGAGGCTTGAGCAGAGAGATTTTTCCGGAGGGTTTTGTGTTCGGGACGGCCACATCGGCATATCAAGTTGAGGGAATGGCTAGCAAGGAGGGTAGAGGACGTAGCATATGGGATGTCTTTGTCAATATTCCAGGAAATATTGTAGATAATGCTACAGGTGAAGTTTCTGTGGATCAGTATCACCATTATAAGGAAGATGTTAATCTGATGCATATGTTGAACTTTGATGCCTACCGGTTTTCGATCTCATGGCCCAGAATTTTTCCAAATGGGACAGGAAAGGTAAACTGGAAGGGAGTTGCCTACTACAATAGATTGATCAACGCCTTGCTTGAGAAAGGAATTACCCCATATGCGAATTTGTACCACTACGATCTCCCACTTGCACTTCAGGAGAAATATGGAGGTTTGCTGGGAGACCAAGTTGTGAAAGATTTTGCTGATTACGCAGATTTCTGTTTCAAGGCATTTGGTGATCGAGTAAAGAATTGGATGACATTCAATGAACCAAGGGTAATTGCTGCTCTTGGATTTGACAATGGCATCAATCCTCCTTGTAGATGTTCAAAGCCATTTGGAAATTGTACTGCTGGAGACTCTGCAACTGAGCCTTATATTGCAGCACATAATTTGATTTTAAGTCATGCTGAAGCTGCTAAAAGATACCGCGAAAAATATCAAACTAAACAGAAGGGAAGAATTGGAATCCTCTTGGACTTTGTTTGGTATGAACCTCTGACAAGAGGAAAGGCTGACAACTATGCAGCACAAAGAGCAAGAGACTTCCATATCGGATGGTTCTTGCACCCCCTTGTATATGGAGAATATCCAAAAACAATGCAAAATATTGTAGGAGAAAGGCTTCCAAAGTTCAGCAAAAGTGATGTCGAGACTGTGAAAAACTCCTTTGATTTTATTGGTATCAACCACTACACCTCTTTCTACATGTATGACCCGCATCAGCCTAAGCCCAATGTGACTGGTTACCAACAGGATTGGAATGTTGGGTTTGCTTTTGAACGTTGGGGAGAGCCAATTGGTCGTCGGGCTCACTCTGGATGGCTGTACCAAGTTCCATGGGGCATATACAAAGCTGTCACATACGTAAAAGAGCGTTATGGAAACCCCAATGTAATTCTCGCAGAAAATGGAATGGATAACCCTGGCAATGTCACATTTCCTGAAGCATTGTTCGATAGAGAAAGAGTAAATTACTATAGAAGCTACTTGAAGGAATTGAAGAGAGCTATGGATGATGGAGCCAATGTGACTGGCTACTTTGCTTGGTCATTGCTTGACAACTTTGAATGGCTTTTGGGTTATAGCTCTAGATTTGGCATGGTATACGTTGATTTCGAAACTCTCAAGAGGTACCCGAAGATGTCAGCTTACTGGTTCAAACAAATGCTTGAGAGAAAGCAGCAGTAG
Protein:  
MTTLVILLLLSLTLFSTALNANADALSHLKTQRLDTGGLSREIFPEGFVFGTATSAYQVEGMASKEGRGRSIWDVFVNIPGNIVDNATGEVSVDQYHHYKEDVNLMHMLNFDAYRFSISWPRIFPNGTGKVNWKGVAYYNRLINALLEKGITPYANLYHYDLPLALQEKYGGLLGDQVVKDFADYADFCFKAFGDRVKNWMTFNEPRVIAALGFDNGINPPCRCSKPFGNCTAGDSATEPYIAAHNLILSHAEAAKRYREKYQTKQKGRIGILLDFVWYEPLTRGKADNYAAQRARDFHIGWFLHPLVYGEYPKTMQNIVGERLPKFSKSDVETVKNSFDFIGINHYTSFYMYDPHQPKPNVTGYQQDWNVGFAFERWGEPIGRRAHSGWLYQVPWGIYKAVTYVKERYGNPNVILAENGMDNPGNVTFPEALFDRERVNYYRSYLKELKRAMDDGANVTGYFAWSLLDNFEWLLGYSSRFGMVYVDFETLKRYPKMSAYWFKQMLERKQQ